04:00
2026-05-28
arxiv.org
artificial-intelligence
Cross-Entropy Games and Frost Training
Researchers introduced Frost Training, a method that improves Monte Carlo-based policy optimization for Cross-Entropy Games by exploiting the gradient of the reward function in embedding space. The teβ¦